Clustering in the membership embedding space

نویسندگان

  • Maurizio Filippone
  • Francesco Masulli
  • Stefano Rovetta
چکیده

In several applications of data mining to high-dimensional data, clustering techniques developed for low-to-moderate sized problems obtain unsatisfactory results. This is an aspect of the curse of dimensionality issue. A traditional approach is based on representing the data in a suitable similarity space instead of the original high-dimensional attribute space. In this paper, we propose a solution to this problem using the projection of data onto a so-called Membership Embedding Space obtained by using the memberships of data points on fuzzy sets centered on some prototypes. This approach can increase the efficiency of the popular Fuzzy C-Means method in the presence of high-dimensional data sets, as we show in an experimental comparisons. We also present a constructive method for prototypes selection based on simulated annealing that is viable for semi-supervised clustering problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Membership Embedding Space Approach and Spectral Clustering

The data representation strategy termed “Membership Embedding” is a type of similarity-based representation that uses a set of data items in an input space as reference points (probes), and represents all data in terms of their membership to the fuzzy concepts represented by the probes. The technique has been proposed as a concise representation for improving the data clustering task. In this c...

متن کامل

Detecting Overlapping Communities in Social Networks using Deep Learning

In network analysis, a community is typically considered of as a group of nodes with a great density of edges among themselves and a low density of edges relative to other network parts. Detecting a community structure is important in any network analysis task, especially for revealing patterns between specified nodes. There is a variety of approaches presented in the literature for overlapping...

متن کامل

A new method for fuzzification of nested dummy variables by fuzzy clustering membership functions and its application in financial economy

In this study, the aim is to propose a new method for fuzzification of nested dummy variables. The fuzzification idea of dummy variables has been acquired from non-linear part of regime switching models in econometrics. In these models, the concept of transfer functions is like the notion of fuzzy membership functions, but no principle or linguistic sentence have been used for inputs. Consequen...

متن کامل

بررسی کاربرد روش فازی (Fuzzy) در طبقه‌بندی خاک‌ها، مطالعه موردی: چشمه سفید کرمانشاه

Chenges in the soil characteristics is rather continuously. A method that takes this continuity into account would present a realistic pattern of soil distribution either in taxonomic or geographical space. The fuzzy set theory provides such an approach. In this study, the robustness of fuzzy clustering in soil pattern recognition was evaluated in a subcatchment of western Iran. The clustering ...

متن کامل

بررسی کاربرد روش فازی (Fuzzy) در طبقه‌بندی خاک‌ها، مطالعه موردی: چشمه سفید کرمانشاه

Chenges in the soil characteristics is rather continuously. A method that takes this continuity into account would present a realistic pattern of soil distribution either in taxonomic or geographical space. The fuzzy set theory provides such an approach. In this study, the robustness of fuzzy clustering in soil pattern recognition was evaluated in a subcatchment of western Iran. The clustering ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IJKESDP

دوره 1  شماره 

صفحات  -

تاریخ انتشار 2009